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Abstract 


The Earth’s land surface boundary conditions in the Goddard Earth Observing System version 5 
(GEOS-5) modeling system were updated using recent high spatial and temporal resolution 
global data products. The updates include: (i) construction of a global 10-arcsec land-ocean- 
lakes-ice mask; (ii) incorporation of a 10-arcsec Globcover 2009 land cover dataset; (iii) 
implementation of Level 12 Pfafstetter hydrologic catchments; (iv) use of hybridized SRTM 
global topography data; (v) construction of the HWSDvl.21-STATSG02 merged global 30 arc- 
second soil mineral and carbon data in conjunction with a highly-refined soil classification 
system; (vi) production of diffuse visible and near-infrared 8-day MODIS albedo climatologies 
at 30-arcsec from the period 2001-2011; and (vii) production of the GEOLAND2 and MODIS 
merged 8-day LAI climatology at 30-arcsec for GEOS-5. The global data sets were pre- 
processed and used to construct global raster data files for the software (mkCatchParam) that 
computes parameters on catchment-tiles for various atmospheric grids. The updates also include 
a few bug fixes in mkCatchParam, as well as changes (improvements in algorithms, etc.) to 
mkCatchParam that allow it to produce tile-space parameters efficiently for high resolution 
AGCM grids. The update process also includes the construction of data files describing the 
vegetation type fractions, soil background albedo, nitrogen deposition and mean annual 2m air 
temperature to be used with the future CatchmentCN model and the global stream channel 
network to be used with the future global runoff routing model. This report provides detailed 
descriptions of the data production process and data file format of each updated data set. 
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1. INTRODUCTION 


In the middle of the last decade, the Catchment Land Surface Model (CLSM: Koster et al, 2000) 
replaced the Mosaic LSM (Koster et al, 1996) as the Goddard Earth Observing System version 5 
(GEOS-5) climate modeling system’s land surface model. CLSM calculates surface energy and 
water budget terms at the Earth’s land surface at every model time step and, in so doing, 
computes surface latent and sensible heat fluxes for the atmospheric model. The use of 
topographically defined hydraulic catchments as fundamental computational units at the surface 
instead of quasi-rectangular cells makes the CLSM somewhat unique among macroscale LSMs. 
The model employs a set of comprehensive state-of-the-art surface hydrologic parameterization 
schemes that require, at each computational element, surface soil hydraulic properties, land cover 
type, albedo, and descriptions of vegetation phenology. 


When the model was first implemented globally about a decade ago, soil, vegetation, and model 
parameters at each computational element were derived using the best available global data sets 
at that time. Below is a description of global data that have been used within the model during 
the last decade: 

1) Simple Biosphere Model 2 land cover classification data from the USGS Global Land 
Cover Characteristics Data Base Version 2.0 (GLCC v2, 2000) aggregated from 30 arc- 
second to 2.5 arc-minute; 

2) HYDRO lk (GTOPO30, 1996) topography data and 36,716 Level 6 Pfafstetter 
catchments comprising Earth’s land surface; 

3) Reynolds (Reynolds et al, 2000) soil texture classes from USGS at 5-arc -minute with 
GSWP-2 soil hydraulic parameters; 

4) AVHRR NDVI-based monthly LAI and greenness fraction climatologies from the period 
1982-1998, provided on a 1° x l°grid by GSWP-2 (Dinneyer and Oki, 2002); 

5) MODIS-based VISDF and NIRDF albedo: 16-day climatologies at 1 arc-minute covering 
the period 2000-2004 (Moody et al., 2008); 

6) GSWP-2 (Dirmeyer and Oki, 2002) depth-to-bedrock data on the 1° x 1° grid. 

Evolution of the computational power of computers, availability of longer records of higher- 
resolution global Earth observation data products based on advanced and newer satellite 
instruments, and more importantly, demand for high resolution modeling and assimilation 
capabilities (e.g., for the GMAO Nature run and for the SMAP mission) necessitated a 
comprehensive update of Land surface Boundary Conditions (LBCs) in GEOS-5. These updates 
include the use of: 

1) The GLOBCOVER 2009 (2011) global land cover classification data at 10 arc-second 
from ESA; 

2) 291,284 Level 12 Pfafstetter catchments to comprise the Earth’s land surface, along with 
topography based on a hybrid of SRTM DEM and other best available global data; 
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3) Merged HWSDvl.21 (HWSD, 2009) and STATSG02 (NRCS, 2012) soil mineral 
component and organic carbon data at 30 arc-second; 

4) Highly refined soil hydraulic properties from De Lannoy et al. (2014); 

5) AVHRR NDVI-based monthly GSWP-2 (Dinneyer and Oki, 2002) greenness fraction 
climatology for the period 1982-1998, spatially interpolated from the 1° grid to 30 arc- 
second; 

6) 8-day, 30-arcsec climatology LAI climatology constructed from GEOLAND2 LAI (Baret 
et ah, 2012 and Camacho et ah, 2013) and MODIS MOD15A2 v005 LAI; 

7) MODIS VISDF and NIRDF (MCD43GF, 2014 and Gao et ah, 2014) albedo 8-day 
climatologies at 30 arc-second for the period 2000-2011; and 

8) GSWP-2 (Dinneyer and Oki, 2002) soil depth-to-bedrock data spatially interpolated from 
the 1° grid to 30 arc-second. 


The Catchment-CN model (Koster et ah, 2014), which employs the canopy conductance and 
carbon physics schemes from NCAR/DOE’s Common Land Model Version 4 (CLM4: Oleson et 
ah, 2010), is a recent addition to GMAO modeling capabilities. The Catchment-CN model 
employs modified CLM land cover types (Catchment-CN types), thereby necessitating the 
generation of a global dataset of these types. A global stream network using topographically 
delineated hydrological catchments was also developed. 


Again, the base description of the Earth’s land surface for GEOS-5 is a mosaic of 
topographically delineated, irregularly shaped hydrologic catchments. Starting from this 
description, GEOS-5 uses two different approaches to define the computational elements that are 
actually used at the land surface. The first approach, typically used when the land model is 
coupled to the atmospheric model, is to overlay the atmospheric grid onto the mosaic of 
irregularly shaped hydrologic catchments and then use the atmosphere’s regular grid lines to sub- 
divide any catchments that straddle adjacent atmospheric cells, so that each new (and smaller) 
land element (or “tile”) sees only one overlying atmospheric cell. (See the left panel of Figure 1 
for an example of a mosaic of catchments further disaggregated by an overlying atmospheric 
grid). The second approach is used for select offline CLSM applications. With this approach, the 
application defines the computational land elements (typically specified to be on a regular grid), 
and the topography data are derived from the underlying dominant hydrologic catchment, i.e., 
the irregular catchment that contributes the largest areal fraction to the defined land element. 
(See the right panel of Figure 1 for one example: the SMAP M36 grid of land surface elements). 


The first of the 2-step process that produces LBCs for GEOS-5 involves pre-processing high 
resolution global data sets, computing climatologies of seasonal variables on native grid pixels, 
and preparing high resolution global raster files for the mkCatchParam software. The second step 
involves running the mkCatchParam software to process those raster data files and aggregate 
variables (or compute dominant vegetation and soil types) to the land model tile space. 

The mkCatchParam software produces vegetation type, vegetation dynamic, soil hydraulic, and 
other model parameters for each computational land element for any given atmospheric grid 
resolution. For convenience, mkCatchParam also generates global map images of all important 
parameters and movies of seasonal parameters. The atmospheric grid resolution-specific data 


7 



files are typically stored in a subdirectory named “clsm” within BCSDIR of the GEOS-5 
experiment. Maps and movies are saved in the “clsm/plots/” directory. The sections below 
provide detailed descriptions of the data production process and of the file format for each 
dataset. 



Figure 1: Illustrations of: (left) land surface elements in the northeast United States for the 
CF0180x6C atmospheric grid, (right) computational grid cells for the SMAP EASEv2 M36 grid. 


2. LAND-OCEAN-LAKES-ICE MASKFILE AND CATCHMENT 
DELINEATION 

First, global 30 arc-second GTOPO30 (see GTOPO30, 1996) topography data were re-gridded to 
10 arc-second and used to construct a mask at 10 arc-second resolution that contains only 3 
surface types: (1) land, (2) lakes, and (3) ocean. Then, glacier locations were overlaid onto the 
dataset using two data sources: (1) Bedmap2 data (Fretwell et ah, 2013) were used over 
Antarctica and everything south of 65°S latitude; and (2) the Randolph Glacier Inventory (RGI) 
Version 4 (Arendt et ah, 2014) data were used to define glaciers north of 65°S latitude. The next 
step was to assign each 10 arc-second land pixel to a watershed whose topographic 
characteristics, namely the statistics of compound topographic index, would be used to 
parameterize the spatial distribution of soil moisture within catchment-tiles, the computational 
land elements at the land surface. 

In independent work, Verdin (2013) used a hybrid of SRTM data and other existing global 
DEMs to delineate Level 12 Pfafstetter hydrologic catchments around the world, producing 
along the way a global 1 arc -minute array of Pfafstetter codes (Verdin and Verdin, 1999). A total 
of 291,254 independent catchments (each with a unique Pfafstetter code) were defined in 
Verdin’s data set. The Level 12 Pfafstetter codes are essentially 12-digit numbers, and these 
large numbers can be computationally challenging to use in catchment-to-grid mapping. 
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Therefore, for convenience, the Level 12 Pfafstetter codes were sorted in ascending order, and 
the corresponding catchments were so indexed from 1 to 291,254. 

Special attention had to be paid to the 30 catchments with the following Pfafstetter codes: 


177552000000, 

177562000000, 

177593500000, 

177647000000, 

177681000000, 

177793900000, 


177553000000, 

177563000000, 

177593600000, 

177648000000, 

177682000000, 

177794100000, 


177554000000, 

177564000000, 

177642300000, 

177649100000, 

177691000000, 

177795200000, 


177555000000, 

177591000000, 

177644100000, 

177660000000, 

177692000000, 

177796100000, 


177561000000, 

177592000000, 

177646000000, 

177678000000, 

177793800000, 

177890000000. 


These 30 catchments are unique in that they straddle the dateline. For logistical reasons, each of 
these 30 catchments was sub-divided at the dateline into two catchments. Catchment index 
numbers were then shifted so that the pair would have adjacent numbers, thereby maintaining the 
ascending order of the Pfafstetter codes. With these additional subdivisions, the Earth’s land 
surface is finally resolved into 291,284 hydrologic catchments. A global 1 arc-minute data array 
of catchment indices (1-291,284) was constructed and then re-gridded into a 10 arc-second data 
array. 


The constructed land-lakes-ice-ocean mask was further modified to produce the final land-lakes- 
ice-ocean mask for the GEOS-5 modeling system. The ocean pixels were assigned a value of 0; 
lake (inland water) pixels were assigned a value of 190000000; and ice pixels were assigned a 
value of 200000000. The above catchment index array was examined against 10 arc-second land 
pixels in the mask array, and land pixels were assigned values equal to the corresponding 
catchment index number. The land pixels that did not have a corresponding catchment index 
were assigned the catchment index number from the “nearest neighbor” pixel in the catchment 
index array. The constructed 10 arc-second land-ocean-lakes-ice mask array was saved in the 
file GEOS5_10arcsec_mask.nc. Note: some of Verdin’s catchments were swallowed into lakes, 
reducing the actual number of hydrologic catchments in the global mask to 290,191. 


3. VEGETATION CLASSIFICATION DATA 

3.1 Data generation and processing chain 

3.1.1 Deriving Mosaic vegetation classes 

The Mosaic model (Koster and Suarez, 1996) utilizes 8 vegetation types. The current version of 
CLSM uses 6 types, with the Mosaic bare soil and desert soil types grouped with Shrubland 
(Table 1). The reduction in type number is explained by the fact that a shrubland type can be 
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made to act like a bare soil or desert type simply by assigning an appropriately low value of leaf 
area index, or LAI, to the surface element. CLSM uses mean seasonal cycles of LAI from global 
data products (see below), thereby ensuring that bare soil and desert soil behavior will appear in 
the correct places, and in any case albedos are forced to match those for diffuse radiation in 
MODIS observations (see below). The absence of the explicit definition of the two bare soil 
types has an insignificant effect on model behavior. 

Global land cover classification at 10-arcsec resolution is available from the European Space 
Agency (see GLOBCOVER 2009). The first step was to make adjustments to GLOBCOVER 
2009 data to match the constructed land-lake-ice-ocean mask in GEOS5_10arcsec_mask.nc in 
Section 2.0. This ensured land, ocean, lake, and ice pixels in the mask will find similar surface 
types in the adjusted GLOBCOVER 2009 data array. The land pixels in GLOBCOVER 2009 
were made consistent with the mask by assigning the nearest neighbor neighbors land cover type 
for the missing and mismatching land pixels in GLOBCOVER 2009 and the modified 
GLOBCOVER 2009 data array was saved in ESAGlobalCover.nc. 

Table 1 shows the mapping scheme employed to convert, at each 10-arcsec pixel, an ESA land 
cover type in ESAGlobalCover.nc into a Mosaic land cover type(s). The Mosaic types used for 
catchment surface elements are detennined by computing the dominant Mosaic type of all 10- 
arcsec pixels within the catchment in question (Figure 2). 


ESA land Corresponding Mosaic 

cover type land cover type 


11 

14 

20 

30 

40 

50 

60 

70 

90 

100 

110 

120 

130 

140 


4 

4 

4 

4 
1 
2 
2 
3 

3 

2 (50%) and 3 (50%) 

2 (30%), 5 (30%), and 4 (40%) 
2 (20%), 5 (20%), and 4 (60%) 

5 

4 
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150 


160 

170 

180 

190 

200 


inside 50°S and 50°N lats: 4 (50%) and 5 (50%) 
outside of 50°S and 50°N lats: 4 (50%) and 6 (50%) 
inside 20°S and 20°N lats: 1 
outside of 20°S and 20°N lats: 2 
1 

4 

4 

5 


Table 1: ESA to Mosaic mapping: The reader is referred to Table 2 of GLOBCOVER 2009 
(2011) for a detailed description of ESA land cover types. The 6 Mosaic land cover types are: 1 
(Broadleaf Evergreen), 2 (Broadleaf Deciduous), 3 (Needleleaf), 4 (Grassland), 5 (Broadleaf 

Shrubs), and 6 (Dwarf trees). 



Figure 2: Mosaic vegetation classes. 
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3.1.2 Deriving Catchment-CN classes 


The Common Land Model version 4 (CLM4: Oleson et al., 2010) utilizes 17 vegetation classes 
(Table 2). A global array of fractional coverage of each of the 17 classes on a 1152x768 grid, as 
used by CLM4 was obtained from UCAR. 

The Catchment-CN model’s vegetation classification is based on that used by CLM4, with 
fractional coverage of each type derived from the Global ESA land cover classification. Each 
ESA type was mapped into one or more of the CLM4 types. For instance, ESA crop types were 
mapped one-to-one into CLM4’s crop type, whereas ESA mosaic vegetation was fractionally 
mapped into region-appropriate grass, shrub, and forest types. This fractional mapping was based 
on the relative fractions found in the original 1152x768 CLM4 gridded arrays and on latitude 
(for differentiating certain types, such as Arctic c3 grass). Bare soil from the ESA land cover 
classification is mapped into the broadleaf deciduous shrub type, since bare soil is not an allowed 
type in our implementation. 

For Catchment-CN, the stress deciduous types (crop and temperate shrubs/grass) utilized by 
CLM4 is replaced by a mix of two sub-types, one that is seasonally deciduous (with a daylight 
trigger) and one that is not. Both sub-types are subject to moisture stress triggers but not to 
temperature (freezing) stress triggers. The removal of the temperature stress trigger eliminated 
unnatural swings in leaf carbon during brief temperature stress senescence (Koster et al. 2014). 
The relative fractions of the two sub-types applied vary linearly with latitude between 32°-42° in 
both hemispheres, with 100% of the stress deciduous type being replaced by the seasonally 
deciduous sub-type at 42° and 100% replaced by the non-seasonally deciduous type at 32°. 

Within each surface element, the two dominant types are identified; the remaining types are 
ignored. (Note, however, that in the latitude band 32°-42°, the presence of two sub-types for a 
potentially dominant type implies that up to four sub-types may be followed.) The vegetation 
fractions for the two dominant types are scaled so that they sum to one. Figure 3 shows global 
maps of primary and secondary Catchment-CN types. 
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Figure 3: Catchment-CN (top) primary vegetation types, and (bottom) secondary vegetation 

types. 
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Land cover 

CLM 

Class 

Catchment- 

CN 

Class 

Map 
Legend 
(Figure 3) 

Bare 

1 

NA 

BARE 

Needleleaf evergreen temperate tree 

2 

1 

NLEt 

Needleleaf evergreen boreal tree 

3 

2 

NLEB 

Needleleaf deciduous boreal tree 

4 

3 

NLDB 

Broadleaf evergreen tropical tree 

5 

4 

BLET 

Broadleaf evergreen temperate tree 

6 

5 

BLEt 

Broadleaf deciduous tropical tree 

7 

6 

BLDT 

Broadleaf deciduous temperate tree 

8 

7 

BLDt 

Broadleaf deciduous boreal tree 

9 

8 

BLDB 

Broadleaf evergreen temperate shrub 

10 

9 

BLEtS 

Broadleaf deciduous temperate shrub 

11 

10 

BLDtS 

Broadleaf deciduous temperate shrub 

NA 

11 

BLDtSm 

[moisture stress only] 

Broadleaf deciduous boreal shrub 

12 

12 

BLDBS 

Arctic c3 grass 

13 

13 

AC3G 

Cool c3 grass 

14 

14 

CC3G 

Cool c3 grass [moisture stress only] 

NA 

15 

CC3Gm 

Wann c4 grass 

15 

16 

WC4G 

Wann c4 grass [moisture stress only] 

NA 

17 

WC4Gm 

Crop 

16 

18 

CROP 

Crop [moisture stress only] 

NA 

19 

CROPm 

Water 

17 

NA 



Table 2: CLM and Catchment-CN land cover classification description. 
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3.1.3 Deriving Nitrogen Deposition, Annual mean 2m Air Temperature, and Soil 
background albedo for the Catchment-CN Model 

Nitrogen deposition data used in CLM4 (Oleson et al., 2010) were obtained from UCAR. The 
approximately quarter degree CLM4 deposition data (sum of NHx and NOx species) were 
spatially interpolated to catchment surface elements and saved (with units of ng m-2 s-1) in the 
“CLM_NDep_SoilAlb_T2m” file (Figure 4, top panel). Note that the units of the input 
deposition data in the Catchment-CN model is g m-2 s- 1 , and thus the values in the file must be 
scaled by 1 .e-9 before using inside the model. 

The Catchment-CN phenology routine computes a "growing degree day" summation to 
determine when seasonal and stress deciduous plant functional types (PFT) become active. The 
critical value for this summation is a function of annual mean 2m air temperature. CLM4 uses 
the previous year's annual mean 2m air temperature (K). In the Catchment-CN model, however, 
we instead use the climatological annual mean 2m air temperature. Two sources of multi-year 
global hydrometeorological forcing were used to compute climatological annual mean 2m air 
temperature separately: 1) GMAO MERRA-2, hourly, 0.625°x0.5° resolution, data from the 
period 1980-2014, and 2) Sheffield et al (2006), 3-hourly, 1-degree data from the period 1948- 
2012. The climatological means were computed on the native forcing grids (see middle and 
bottom panels in Figure 4), then spatially interpolated to catchment surface elements, and finally 
saved in the file “CLM_NDep_SoilAlb_T2m”. 

In areas of low LAI (less than one), surface albedo in the Catchment-CN model is prescribed 
using MODIS soil background albedo. For LAI=0, the soil background albedo has full weight; at 
LAI=1, the albedo is fully determined by mapped catchment vegetation type (using subroutine 
SIB ALB). A linear ramp is used to weight between the MODIS soil background albedo and the 
SIB ALB albedo for 0<LAI<1. There are four albedo components: visible direct (VISDR), visible 
diffuse (VISDF), near-infrared direct (NIRDR), and near-infrared diffuse (NIRDF). Global fields 
of soil background VISDR, VISDF, NIRDR and NIRDF at 3-arcmin resolution were obtained 
from Houldcroft et al. (2009) and spatially aggregated to catchment surface elements for the 
Catchment-CN model (Figure 5). 
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2 1 

Figure 4: (top) Nitrogen deposition (sum of NHx and NOx species) [ng m' s' ], (middle) Mean 
annual 2m air temperature from MERRA-2 [K] for the period 1980-2014, (bottom) Mean annual 
2m air temperature from Sheffield et al. (2006) data [K] for the period 1948-2012. 
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Figure 5: Global maps of Houldcroft et al. (2009) soil background albedo data [-]. 


3.1.4 Deriving Canopy Height Data 

Global 30-arcsec canopy height data, used (in some implementations) in the calculation of 
surface roughness, were obtained from NASA’s Jet Propulsion Laboratory (Simrad et al., 2011). 
These heights were spatially aggregated to catchment surface elements (Figure 6). 
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JPL : Canopy Height 



Figure 6: Map of canopy height data [m]. 


3.2 Data files and images 


3.2.1 Mosaic vegetation types and fractions 


file name: mosaic_veg_typs_f racs 


do n = 1, NTILES 

read ( 10 , * ' ' ) tile_index, pf af_code, 

primary veg type, secondary_veg_type, 
primary_veg_f rac, secondary_veg_f rac, canopy_height 
end do 

where for each tile: 


tile_index [-] 
pfaf_code [-] 


primary_veg_type [-] 
secondary_veg_type [-] 
primary_veg_f rac [-] 
secondary_veg_f rac [-] 
canopy_height [m] 


Tile Number 

catchment index (1-291284) after 
sorting Pfafstetter codes in 
ascending order 
primary vegetation type 
secondary vegetation type 
primary vegetation fraction 
secondary vegetation fraction 
canopy height 
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3.2.2 CLM and Catchment-CN vegetation types and fractions 


file name: CLM_veg_typs_f racs 
do n = 1, NTILES 

read (10, ' (218, 413, 4f7 .2, 213, 2f7 .2) ' ) & 

tile^index, pf af_code, & 

CLM-C_ptl, CLM-C_pt2, CLM-C_stl, CLM-C_st2, & 
CLM-C_pf 1, CLM-C_pf2, CLM-C_sf 1, CLM-C_sf2, & 
CLM_pt, CLM_st , CLM_pf, CLM_sf 

enddo 

Where for each tile: 


tile index 

[-] 

pfaf code 

[-] 

CLM-C ptl 

[-] 

CLM-C pt2 

[-] 

CLM-C stl 

[-] 

CLM-C st2 

[-] 

CLM-C pfl 

[-] 

CLM-C pf 2 

[-] 

CLM-C sfl 

[-] 

CLM-C sf 2 

[-] 

CLM_pt [-] 


CLM st [-] 


CLM_pf [-] 


CLM sf [-] 



Tile Number 

catchment index (1-291284) after sorting 
Pfafstetter codes in ascending order 
Catchment-CN primary type 1 
Catchment-CN primary type 2 (if sub-types 

exist, in 32°-42° latitudinal band) 
Catchment-CN secondary type 1 

Catchment-CN secondary type 2 (if sub-types 

exist, in 32°-42° latitudinal band) 
Catchment-CN fraction of 1st primary type 
Catchment-CN fraction of 2nd primary type 
(if sub-types exist, in 32°-42° latitudinal 
band) 

Catchment-CN fraction of 1st secondary type 
Catchment-CN fraction of 2nd secondary type 
(if sub-types exist, in 32°-42° latitudinal 
band) 

CLM primary type 
CLM secondary type 
CLM fraction of primary type 
CLM fraction of secondary type 


3.2.3 Nitrogen Deposition, Annual mean 2m Air Temperature, and Soil background 
albedo for the Catchment-CN Model 


file name: CLM_NDep_SoilAlb 
do n = 1, NTILES 

read (10, ' (fl0.4, 4f7 .4, 2f8.3) ' ) & 
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NDEP, VISDR, VISDF, NIRDR, NIRDF, T2_M, T2_S 

enddo 

Where for each tile: 


NDEP [ngm-2s-l] 
VISDR [-] 

VISDF [-] 

NIRDR [-] 

NIRDF [-] 

T2_M [K] 

T2 S [K] 


Nitrogen deposition 

Direct visible soil background albedo 
Diffuse visible soil background albedo 
Direct near-infrared soil background 
albedo 

Diffuse near-infrared soil background 
albedo 

Mean annual 2m air temperature from MERRA- 
2 (averaged over 1980-2014) 

Mean annual 2m air temperature from 
Sheffield et al . (2006) (averaged over 

1980-2014) 


4. TOPOGRAPHY AND SOIL DATA 


4.1 Data generation and processing chain 


Verdin (2013) produced global 1-arcmin raster arrays of Compound Topographic Index (CTI) 
statistics: mean, standard deviation and skewness. As described in Section 2, Verdin (2013) also 
delineated the Earth’s land surface into 291,254 hydrologic catchments, and sub-dividing 30 
dateline catchments increased the total number of catchments to 291,284. Furthermore, Verdin 
(2013) produced an associated global 1 arc-minute raster array of Level 12 Pfafstetter codes to 
map those hydrologic catchments. The 1 arc-minute data array was also updated to accommodate 
the changes stemming from the 30 dateline catchments. 

The following approach was employed to derive statistics of CTI for each of the 291,284 
catchment using the available CTI statistics at 1 arc -minute resolution. First, the N pixels 
belonging to a given catchment were identified. For each of these pixels, a 5000-element array 
of sample CTIs was constructed using a 3-parameter gamma distribution; the 5000-element array 
was constructed so as to have the same mean, standard deviation, and skewness as was identified 
for the pixel in the Verdin (2013) dataset. If the Verdin (2013) data did not provide a skewness 
value for the pixel, a Gaussian distribution was used to construct the 5000-element CTI array. 

Second, the 5000 elements from each of the N pixels making up the given catchment were 
combined to form a single, 5000 x N-element array of CTIs. Finally, the mean, standard 
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deviation, and skewness of the CTI values in this single larger array were computed and assigned 
to the catchment. Figure 4 shows the resulting global maps of catchment-level CTI statistics. 

The user is referred to De Lannoy et al. (2014) for a complete description of the procedure used 
to generate our global datasets of soil type. The basis of the datasets is a merging of STATSG02 
(NRCS, 2012) data with the Flarmonized World Soil Data (HWSD, 2009) version 1.21 dataset, 
with special techniques employed to fdl data gaps. The merging process produced global arrays 
of sand, clay and organic matter percentages at 30 arc-second resolution for two soil layers: 0- 
30cm and 30-100cm. From these data, De Lannoy et al. (2014) defined 253 soil classes, and 
from these classes they derived soil hydraulic properties for the surface and profile layers, 
separately (Table 3). The representative (and thus utilized) soil type for a given hydrological 
catchment was determined through a somewhat complex procedure developed by De Lannoy et 
al. (2014); the upshot of the procedure is that the chosen type actually appears within the 
catchment and has, compared to all of the 30 arc-second pixels in the catchment, roughly median 
texture percentages. The surface soil hydraulic properties for the 253 classes appear in Table 3. 
See Figure 5 for corresponding global maps of soil hydraulic properties. 

The Second Global Soil Wetness Project (Dinneyer, and Oki, 2002) provided a global, 1 -degree 
dataset of soil depth to bedrock. The GSWP-2 soil depth data were spatially interpolated onto 30- 
arcsec pixels. The interpolated soil depth data were averaged across a given catchment land 
element to detennine that catchment’s effective soil depth. 


4.2 Data files and images 

4.2.1 Tile types, location, area and Pfafstetter catchment mapping 

file name: BCSDIR/BCRSLV/BCRSLV-Pf af stetter . til 
The 8-line header contains: (line 1) N_GLOBAL, number of 

columns in the global raster array of tile indices, number 
of rows in the global raster array of tile indices; (line 
2) "2" depicts that all land, ocean, lakes, and ice tiles 
are included in the table below; (line 3) locations of the 
poles (PE is for pole edge grid, and PC is pole center 
grids) , grid resolution and location of the dateline for 
finite volume atmospheric grids (CF for cube-sphere grids; 
DE is for finite element grids with the dateline at the 
western edge of the grid, and DC is for finite element 
grids with the dateline lying along the center of the first 
column); (line 4) number of columns in the atmospheric grid 
(AGCM_IM) ; (line 5) number of rows in the atmospheric grid 
(AGCM_JM) ; (line 6) locations of the poles (PE or PC) , grid 
resolution and location of the dateline (DE or DC) for the 
ocean grid ; (line 7) number of columns in the ocean grid 
(OGCM_IM) ; and (line 8) number of rows in the ocean grid 
(OGCM_JM) . The 8-line header is followed by a table that 
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has N_GLOBAL number of rows containing tile geographic 
information for each tile separately. 

do n = 1,N_GL0BAL ! (including ocean, glaciers, and lakes) 
read (10,*) type, area, longitude, latitude, & 
ig, jg, cell_frac, pfaf_index, pfaf_code, pfaf_frac 
end do 

Note: the below description is valid only for the land 
block of the table, i.e. for rows whose surface type is 100 
in the first column. The ocean, lakes and ice tiles have 
their own indexing and tile specific fractions while the 
format of the read statement remains unchanged. 

For each tile: 


type [-] 


tile type (100-land; 0-ocean; 19- 
lakes; 20-ice) 

longitude 

[degree] 

longitude at the centroid of the tile 

latitude 

[degree] 

latitude at the centroid of the tile 

ig [-] 


i-index of the AGCM grid cell where 
the tile is located (1 - AGCM IM) 

jg [-1 


j -index of the AGCM grid cell where 
the tile is located (1 - AGCM JM) 

pf af index [ - ] 

catchment index (1-291,284) after 
sorting Pfafstetter codes in ascending 
order 

cell frac 

[-] 

fraction of the AGCM grid cell covered 
by the tile 

pfaf frac 

[-] 

fraction of the Pfafstetter catchment 
covered by the tile 


4.2.2 Western, eastern, southern, and northern edges and mean elevation of tiles 

file name: catchment . def 

read (10,*) NTILES 
do n = 1, NTILES 

read (10, ' (i8, i8, 5 (2x, f 9 . 4) ) ' ) tile_index, pf af^index, 
min_lon, max_lon, min_lat , max_lat , mean_elevation ) 

end do 

where for each tile: 
tile_index [-] Number 

pfaf_index [-] catchment index (1-291,284) after 
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min_lon [degree] 
max_lon [degree] 
min_lat [degree] 
max_lat [degree] 
mean elevation [m] 


sorting Pfafstetter codes in ascending 
order 

Westernmost edge of tile 
Easternmost edge of tile 
Southernmost edge of tile 
Northernmost edge of tile 
area-averaged elevation of tile 


4.2.3 Tile topography - statistics of Compound Topographic Index (CTI) 

file name: cti_stats.dat 

read (10,*) NTILES 
do n = 1, NTILES 

read (10, ' (i8, i8, 5 (lx, f8 . 4) ) ' ) tile_index, pf af_index, 

& 

cti^mean, cti_std, cti_min, cti^max, cti_skew 
end do 

where for each tile: 


tile index 

Number [-] 

pfaf index 

catchment index (1-291284) after sorting 
Pfafstetter codes in ascending order [-] 

cti mean 

mean CTI of the underlying hydrologic 
catchment [log(m)] 

cti std 

standard deviation of CTI of the 
underlying hydrologic catchment 

cti min 

minimum CTI value in the underlying 
hydrologic catchment 

cti max 

maximum CTI value in the underlying 
hydrologic catchment 

cti skew 

skewness of CTI of the underlying 
hydrologic catchment 


4.2.4 Soil Parameters 

file name: soil_param.dat 
do n = 1, NTILES 

read (10, ' (i8,i8,i4,i4, 3f8.4, fl2 .8, f7 .4, fl0.4, 3f7 .3, 4f7 .3, 

& 

2fl0.4)') tile^index, pfaf_code, soil_class_top, & 
soil_class_com, BEE, PSIS, POROS, COND, WPWET, & 
soildepth, gravel, OrgCarbon_top, OrgCarbon_rz , & 
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sand_top, clay_top, sand_rz, clay_rz, WPWET_top, & 
POROS_top 
end do 

where for each tile: 


tile index [-] 

Number 

pfaf index [-] 

catchment index (1-291,284) after 
sorting Pfafstetter codes in ascending 
order 

soil class top [-] 

soil class for the surface layer 
( 0-30cm) 

soil class com [-] 

soil class for the root-zone layer 
(0-100cm) 

BEE [-] 

b-parameter of the tension curve 

PSIS [m H20] 

air entry pressure for the root-zone 
(matric potential) 

POROS [m3/m3] 

soil moisture content at saturation for 
the root-zone 

COND (m/s) 

saturated hydraulic conductivity at the 
surface 

WPWET [-] 

ratio of wilting point to porosity for 
the root-zone 

soildepth [mm] 

depth to bedrock 

gravel [vol% ] 

percentage of gravel in the surface 
layer (0-30cm) 

OrgCarbon top [w %] 

percentage of organic carbon in the 
surface layer (0-30cm) 

OrgCarbon rz [w %] 

percentage of organic carbon in the 
root zone (O-lOOcm) 

sand top [w %] 

percentage of sand in the surface layer 
( 0-30cm) 

clay top [w%] 

percentage of clay in the surface 
layer (0-30cm) 

sand rz [w %] 

percentage of sand in the root-zone 
layer (0-100cm) 

clay rz [w %] 

percentage of clay in the root-zone 
layer (0-100cm) 

WPWET top [-] 

ratio of wilting point to porosity for 
the surface layer (0-30cm) 

POROS top [m3/m3] 

soil moisture content at saturation in 
the surface layer (0-30cm) 
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Figure 7: (top) Mean of Compound Topographic Index (CTI) in catchment space, (middle) 
standard deviation of CTI, and (bottom) skewness of CTI. 
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Figure 8 : Select soil hydraulic parameters for the CF0360x6C model tile space: (top left) b 
parameter (the parameter describing the shape of the water retention curve), (top right) air entry 
pressure, \\r s (m FFO), (middle left) porosity (m Vm '), (middle right) hydraulic conductivity (ms -1 ) 
at the surface, (bottom left) wilting point wetness (-), and (bottom right) soil depth-to-bedrock 

(mm). 
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Class 

Cl 

Sa 

OC 

6s * 

Pb 

e s 

b 


K s 

wp 

fc 


[ w %] 

[ w %] 

[ w %] 

[ m 3 / m 3 ] 

[ g / cm 3 ] 

[ m 3 / m 3 ] 

[-1 

[ m ] 

[ m / s ] 

[ m 3 / m 3 ] 

[ m 3 / m 3 ] 

1 

53.33 

43.33 

0.26 

0.47 

1.32 

0.47 

4.94 

- 0.28 

2 . 10E-05 

0.130 

0.28 

2 

56.67 

36.67 

0.26 

0.49 

1.26 

0.50 

5.41 

- 0.40 

1 . 42E-05 

0.165 

0.33 

3 

53.33 

33.33 

0.26 

0.48 

1.27 

0.50 

6.21 

- 0.37 

9 . 85E-06 

0.189 

0.35 

4 

56.67 

26.67 

0.26 

0.50 

1.21 

0.52 

6.01 

- 0.54 

8 . 84E-06 

0.205 

0.39 

5 

53.33 

23.33 

0.26 

0.50 

1.22 

0.52 

6.33 

- 0.50 

7 . 34E-06 

0.211 

0.39 

6 

56.67 

16.67 

0.26 

0.52 

1.16 

0.55 

5.91 

- 0.74 

6 . 84E-06 

0.222 

0.42 

7 

53.33 

13.33 

0.26 

0.52 

1.17 

0.54 

6.08 

- 0.67 

5 . 81E-06 

0.222 

0.42 

8 

56.67 

6.67 

0.26 

0.54 

1.10 

0.57 

5.54 

- 1.00 

5 . 45E-06 

0.228 

0.45 

9 

53.33 

3.33 

0.26 

0.54 

1.12 

0.56 

5.66 

- 0.90 

4 . 56E-06 

0.225 

0.44 

10 

43.33 

53.33 

0.26 

0.43 

1.43 

0.42 

5.02 

- 0.15 

2 . 09E-05 

0.107 

0.23 

11 

46.67 

46.67 

0.26 

0.44 

1.38 

0.45 

5.81 

- 0.19 

1 . 41E-05 

0.143 

0.28 

12 

43.33 

43.33 

0.26 

0.44 

1.38 

0.45 

6.42 

- 0.20 

9 . 60E-06 

0.161 

0.29 

13 

46.67 

36.67 

0.26 

0.46 

1.33 

0.48 

6.60 

- 0.26 

8 . 61E-06 

0.181 

0.32 

14 

43.33 

33.33 

0.26 

0.46 

1.34 

0.47 

6.68 

- 0.27 

7 . 01E-06 

0.183 

0.32 

15 

46.67 

26.67 

0.26 

0.48 

1.29 

0.49 

6.66 

- 0.35 

6 . 56E-06 

0.198 

0.35 

16 

43.33 

23.33 

0.26 

0.47 

1.30 

0.49 

6.57 

- 0.36 

5 . 45E-06 

0.195 

0.35 

17 

46.67 

16.67 

0.26 

0.49 

1.24 

0.51 

6.43 

- 0.46 

5 . 15E-06 

0.208 

0.38 

18 

43.33 

13.33 

0.26 

0.49 

1.25 

0.50 

6.30 

- 0.47 

4 . 21E-06 

0.201 

0.37 

19 

46.67 

6.67 

0.26 

0.51 

1.19 

0.53 

6.07 

- 0.61 

3 . 97E-06 

0.212 

0.40 

20 

43.33 

3.33 

0.26 

0.50 

1.21 

0.52 

5.93 

- 0.60 

3 . 15E-06 

0.204 

0.39 

21 

33.33 

63.33 

0.26 

0.40 

1.50 

0.39 

4.45 

- 0.11 

2 . 13E-05 

0.077 

0.18 

22 

36.67 

56.67 

0.26 

0.41 

1.47 

0.41 

5.41 

- 0.13 

1 . 40E-05 

0.112 

0.23 

23 

33.33 

53.33 

0.26 

0.41 

1.47 

0.42 

5.77 

- 0.14 

9 . 41E-06 

0.125 

0.24 

24 

36.67 

46.67 

0.26 

0.43 

1.43 

0.43 

6.23 

- 0.17 

8 . 31E-06 

0.146 

0.27 

25 

33.33 

43.33 

0.26 

0.42 

1.44 

0.43 

6.09 

- 0.19 

6 . 64E-06 

0.145 

0.27 

26 

36.67 

36.67 

0.26 

0.44 

1.40 

0.45 

6.39 

- 0.23 

6 . 16E-06 

0.163 

0.30 

27 

33.33 

33.33 

0.26 

0.44 

1.40 

0.45 

6.08 

- 0.26 

5 . 02E-06 

0.157 

0.29 

28 

36.67 

26.67 

0.26 

0.45 

1.36 

0.46 

6.29 

- 0.31 

4 . 72E-06 

0.173 

0.32 

29 

33.33 

23.33 

0.26 

0.45 

1.37 

0.46 

5.92 

- 0.34 

3 . 77E-06 

0.164 

0.31 

30 

36.67 

16.67 

0.26 

0.46 

1.32 

0.48 

6.06 

- 0.40 

3 . 56E-06 

0.179 

0.34 

31 

33.33 

13.33 

0.26 

0.46 

1.34 

0.47 

5.68 

- 0.44 

2 . 75E-06 

0.168 

0.33 

32 

36.67 

6.67 

0.26 

0.48 

1.28 

0.49 

5.76 

- 0.51 

2 . 59E-06 

0.182 

0.35 

33 

33.33 

3.33 

0.26 

0.47 

1.30 

0.48 

5.39 

- 0.56 

1 . 93E-06 

0.170 

0.34 

34 

23.33 

73.33 

0.26 

0.39 

1.54 

0.37 

3.45 

- 0.10 

2 . 28E-05 

0.045 

0.14 

35 

26.67 

66.67 

0.26 

0.39 

1.52 

0.39 

4.41 

- 0.11 

1 . 45E-05 

0.075 

0.18 

36 

23.33 

63.33 

0.26 

0.39 

1.52 

0.40 

4.55 

- 0.13 

9 . 59E-06 

0.084 

0.19 

37 

26.67 

56.67 

0.26 

0.40 

1.50 

0.41 

5.15 

- 0.15 

8 . 21E-06 

0.106 

0.22 

38 

23.33 

53.33 

0.26 

0.40 

1.50 

0.41 

4.87 

- 0.18 

6 . 45E-06 

0.102 

0.22 

39 

26.67 

46.67 

0.26 

0.41 

1.48 

0.42 

5.36 

- 0.20 

5 . 84E-06 

0.121 

0.25 

40 

23.33 

43.33 

0.26 

0.41 

1.48 

0.42 

4.94 

- 0.24 

4 . 66E-06 

0.112 

0.24 

41 

26.67 

36.67 

0.26 

0.42 

1.45 

0.43 

5.34 

- 0.27 

4 . 30E-06 

0.130 

0.27 
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42 

23.33 

33.33 

43 

26.67 

26.67 

44 

23.33 

23.33 

45 

26.67 

16.67 

46 

23.33 

13.33 

47 

26.67 

6.67 

48 

23.33 

3.33 

49 

13.33 

83.33 

50 

16.67 

76.67 

51 

13.33 

73.33 

52 

16.67 

66.67 

53 

13.33 

63.33 

54 

16.67 

56.67 

55 

13.33 

53.33 

56 

16.67 

46.67 

57 

13.33 

43.33 

58 

16.67 

36.67 

59 

13.33 
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2 . 94 E -05 

0.028 

0.12 

218 

16.67 

76.67 

1.12 

0.42 

1.43 

0.41 

3.51 

- 0.12 

1 . 77 E -05 

0.053 

0.16 

219 

13.33 

73.33 

1.12 

0.43 

1.41 

0.42 

3.56 

- 0.15 

1 . 20 E -05 

0.060 

0.18 
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220 

16.67 

66.67 

1.12 

0.43 

1.41 

0.42 

4.16 

- 0.15 

9 . 81E-06 

0.081 

0.20 

221 

13.33 

63.33 

1.12 

0.43 

1.39 

0.43 

3.85 

- 0.19 

7 . 86E-06 

0.076 

0.20 

222 

16.67 

56.67 

1.12 

0.43 

1.39 

0.43 

4.39 

- 0.20 

6 . 83E-06 

0.095 

0.23 

223 

13.33 

53.33 

1.12 

0.44 

1.38 

0.44 

3.94 

- 0.25 

5 . 54E-06 

0.086 

0.23 

224 

16.67 

46.67 

1.12 

0.44 

1.37 

0.44 

4.43 

- 0.26 

4 . 94E-06 

0.105 

0.25 

225 

13.33 

43.33 

1.12 

0.44 

1.37 

0.44 

3.93 

- 0.32 

3 . 92E-06 

0.092 

0.24 

226 

16.67 

36.67 

1.12 

0.45 

1.35 

0.45 

4.38 

- 0.33 

3 . 54E-06 

0.111 

0.26 

227 

13.33 

33.33 

1.12 

0.45 

1.35 

0.45 

3.86 

- 0.41 

2 . 71E-06 

0.096 

0.26 

228 

16.67 

26.67 

1.12 

0.45 

1.33 

0.45 

4.28 

- 0.42 

2 . 46E-06 

0.114 

0.28 

229 

13.33 

23.33 

1.12 

0.45 

1.33 

0.45 

3.75 

- 0.52 

1 . 81E-06 

0.099 

0.27 

230 

16.67 

16.67 

1.12 

0.46 

1.30 

0.46 

4.13 

- 0.52 

1 . 65E-06 

0.117 

0.29 

231 

13.33 

13.33 

1.12 

0.46 

1.31 

0.46 

3.62 

- 0.64 

1 . 16E-06 

0.101 

0.29 

232 

16.67 

6.67 

1.12 

0.47 

1.28 

0.47 

3.97 

- 0.63 

1 . 05E-06 

0.117 

0.31 

233 

13.33 

3.33 

1.12 

0.47 

1.28 

0.46 

3.47 

- 0.78 

7 . 11E-07 

0.101 

0.30 

234 

3.33 

93.33 

1.12 

0.46 

1.32 

0.43 

1.78 

- 0.18 

3 . 82E-05 

0.010 

0.08 

235 

6.67 

86.67 

1.12 

0.44 

1.36 

0.42 

2.38 

- 0.16 

2 . 20E-05 

0.024 

0.12 

236 

3.33 

83.33 

1.12 

0.45 

1.33 

0.44 

2.38 

- 0.21 

1 . 45E-05 

0.027 

0.14 

237 

6.67 

76.67 

1.12 

0.44 

1.37 

0.43 

2.84 

- 0.19 

1 . 14E-05 

0.041 

0.16 

238 

3.33 

73.33 

1.12 

0.45 

1.35 

0.44 

2.58 

- 0.25 

8 . 87E-06 

0.037 

0.16 

239 

6.67 

66.67 

1.12 

0.44 

1.37 

0.44 

3.00 

- 0.25 

7 . 40E-06 

0.051 

0.18 

240 

3.33 

63.33 

1.12 

0.45 

1.35 

0.44 

2.65 

- 0.32 

5 . 85E-06 

0.043 

0.18 

241 

6.67 

56.67 

1.12 

0.44 

1.37 

0.44 

3.05 

- 0.32 

5 . 03E-06 

0.058 

0.20 

242 

3.33 

53.33 

1.12 

0.44 

1.36 

0.44 

2.65 

- 0.41 

3 . 89E-06 

0.047 

0.20 

243 

6.67 

46.67 

1.12 

0.44 

1.37 

0.44 

3.04 

- 0.41 

3 . 40E-06 

0.062 

0.22 

244 

3.33 

43.33 

1.12 

0.44 

1.36 

0.44 

2.61 

- 0.53 

2 . 54E-06 

0.050 

0.22 

245 

6.67 

36.67 

1.12 

0.44 

1.36 

0.44 

2.98 

- 0.52 

2 . 25E-06 

0.065 

0.24 

246 

3.33 

33.33 

1.12 

0.44 

1.36 

0.44 

2.54 

- 0.68 

1 . 61E-06 

0.053 

0.24 

247 

6.67 

26.67 

1.12 

0.45 

1.35 

0.44 

2.90 

- 0.67 

1 . 43E-06 

0.068 

0.25 

248 

3.33 

23.33 

1.12 

0.45 

1.35 

0.44 

2.46 

- 0.88 

9 . 81E-07 

0.054 

0.26 

249 

6.67 

16.67 

1.12 

0.45 

1.34 

0.44 

2.80 

- 0.84 

8 . 76E-07 

0.069 

0.27 

250 

3.33 

13.33 

1.12 

0.45 

1.34 

0.44 

2.38 

- 1.11 

5 . 74E-07 

0.056 

0.28 

251 

6.67 

6.67 

1.12 

0.46 

1.32 

0.45 

2.69 

- 1.04 

5 . 13E-07 

0.070 

0.29 

252 

3.33 

3.33 

1.12 

0.45 

1.33 

0.44 

2.28 

- 1.38 

3 . 21E-07 

0.056 

0.30 

253 

n/a 

n/a 

> 8.72 

n/a 

n/a 

0.80 

3.41 

- 1.76 

7 . 86E-07 

0.216 

0.66 


Table 3: Highly refined soil classes and soil hydraulic properties where for the 0-3 0cm top soil 
layer: Cl is the percentage clay ; Sa is the percentage sand; OC is the organic carbon content; 0 S 
is the first approximation of soil moisture content at saturation that was used in pedotransfer 
functions to derive hydraulic properties; Pb is the soil bulk density; 0 S is the soil moisture 
content at saturation; b is a parameter describing the shape of the water retention curve; ¥ s is the 
matric potential at saturation; K s is the hydraulic conductivity at saturation; wp is the wilting 
point; fc is the field capacity (adapted from De Lannoy et ah, 2014). 
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5. SEASONALLY-VARYING VEGETATION DATA 


5.1 Data generation and processing chain 


The Second Global Soil Wetness Project (GSWP-2) provided monthly Leaf Area Index (LAI) 
and Greenness Fraction (GrnFrac) data on a l°xl° grid for the period 1982-1998. A monthly 
climatology of GrnFrac was computed from these data by temporally averaging over the 17-year 
period (by month) on the l°xl° grid and then spatially interpolating the averages onto 30-arcsec 
pixels. The interpolated GrnFrac data were aggregated over the pixels of each land element to 
derive a monthly GrnFrac climatology for that land element. 

Global, 10-day averaged LAI data on a 40320 x 20160 grid for the period 1999-2011 are available 
from GEOLAND2 (Baret et al., 2012 and Camacho et el. 2013). In addition, 8-day composites of 
MOD 1 5 A2 v005 MODIS LAI data (MODIS, 2008) are available at 30-arcsec (43200x21600) 
for the period 2000-2013. Preprocessing of the two datasets showed that each had potential 
flaws, with GEOLAND2 showing questionable seasonal cycles in Siberia, and MODIS showing 
questionable values over the rain forests. We thus decided to produce a merged LAI data product 
for GEOS-5 to avoid these potential deficiencies. 

The first step in generating the merged product was computing a 10-day climatology of 
GEOLAND LAI at each GEOLAND2 pixel from the 13 years of GEOLAND2 data and then 
spatially aggregating the pixel-based climatologies to surface elements. The next step was 
computing the corresponding 8-day MODIS-based LAI climatology for each surface element 
from the 14-years of MODIS data. To do this, we used MODIS auxiliary data on surface type to 
fill in LAI values at certain land pixels as follows: barren, rock, and desert surface types were 
given LAI values of 0.01, urban-built areas were given values of 0.5; and marshland areas were 
given values obtained from nearest neighbor pixels. The filled-in MODIS data had to be 
projected from a sinusoidal grid to a regular latitude/longitude grid prior to the calculation of the 
pixel-based 8-day climatologies and the subsequent spatial aggregation of the pixel climatologies 
to surface element climatologies. 

Note that in high latitudes, both MODIS and GEOLAND2 data are not available over large areal 
swaths. To address this, we constructed, at every time slice (8 days for MODIS and 10 days for 
GEOLAND2), a l°xl° global gridded LAI dataset by spatially aggregating the finer resolution 
LAI climatological data. Missing LAI values in the finer resolution datasets were filled with the 
value for the ‘nearest neighbor’ on the l°xl° global grid. 

To merge the GEOLAND2 and MODIS data into a single product of 8-day LAI climatologies, 
the 10-day GEOLAND2 climatological seasonal cycle at each element was first time- 
interpolated to the 8-day cycle used by MODIS. MODIS LAI data were then selected for all 
surface elements except those in South America, Africa and Australia, which took their values 
from the GEOLAND2 LAI dataset. Figure 9 shows global maps of climatological mean monthly 
of LAI for the merged LAI data. 
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5.2 Data files and movies 

5.2.1 Greenness Fraction [-] 

file name : green.dat (or . . /green_clim* . data) 
file format: fortran binaries, little_endian 
The read statement should be MAPL_ReadForcing compatible. 
Each data record is preceded by a header containing 
the start and end times of the period over which the 
data have been averaged. Corresponding start dates for 
climatological data records are: 1201 0101 0201 0301 
0401 0501 0601 0701 0801 0901 1001 1101 1201 0101 
(MMDD: where MM is month and DD is day of month) 

Reading data: Loop over the next two lines until the last 
data record is read. 

read (10) Year_Begin, Month_Begin, Day_Begin, Hour_Begin, 
Minute_Begin, Secs_Begin, Year_End, Month_End, 

Day_End, Hour_End, Minute_End, Secs_End (Float 
Numbers ) 

Figure 9: Mean monthly Leaf Area Index (LAI) for the merged data. 
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read(10) (data (n) , n=l, NTILES) 


5.2.2 Leaf Area Index (LAI) [m 2 /m 2 ] 

file name : lai.dat (or . . /lai_clim* . data) 
file format: fortran binaries, little_endian 
The read statement should be MAPL_ReadForcing compatible. 
Each data record is preceded by a header containing 
the start and end times of the period over which the 
data have been averaged. Corresponding start dates for 
climatological data records are: 1227 0101 0109 0117 
0125 0202 0210 0218 0226 0306 0314 0322 0330 0407 0415 

0423 0501 0509 0517 0525 0602 0610 0618 0626 0704 0712 

0720 0728 0805 0813 0821 0829 0906 0914 0922 0930 1008 

1016 1024 1101 1109 1117 1125 1203 1211 1219 1227 0101 

(MMDD: where MM is month and DD is day of month) 
Reading data: Loop over the next two lines until the last 
data record is read. 

read (10) Year_Begin, Month_Begin, Day_Begin, Hour_Begin, 
Minute_Begin, Secs_Begin, Year_End, Month_End, 

Day_End, Hour_End, Minute_End, Secs_End (Float 
Numbers ) 

read(10) (data (n) , n=l , NTILES) 


6. SURFACE ALBEDO DATA 


6.1 Data generation and processing chain 


The Catchment Land Surface Model (CLSM) computes, at each model time step, the following 
quantities: (1) visible (0.3-0. 7pm) direct albedo (black sky), (2) near-infrared (0.7-5. 0pm) direct 
albedo, (3) visible (0.3-0. 7pm) diffuse albedo (white sky), and (4) near-infrared (0.7-5. 0pm) 
diffuse albedo. Initial diurnally-varying values are first computed using an albedo scheme 
(Koster and Suarez 1991) based on the two-stream approximation utilized by SiB (Sellers et al. 
1986). These values are then scaled so that their 8-day averages agree with a MODIS-based 
albedo climatology. 

To compute the scaling factors, 30-arcsec 8-day composites of MODIS (MCD43GF, 2014 and 
Gao et al., 2014) diffuse visible (VISDF) and diffuse near-infrared data (NIRDF) from the period 
2001-2011 were temporally averaged into an 8-day climatology. These 30-arcsec climatological 
values were then spatially averaged over a given land surface element’s pixels to produce an 8- 
day climatology for the land element as a whole. 
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Meanwhile, the SiB-based albedo scheme was run at a daily time step over a 1-year period using 
the vegetation types, greenness fractions, and leaf area indices established for GEOS-5 for a 
given distribution of land elements, as described in sections 3.2.1, 5.2.1, and 5.2.2 above. 
Averaging the visible diffuse and near-infrared diffuse albedos generated by the scheme over 8- 
day periods produced, in effect, an 8-day ‘climatology’ of this particular scheme’s diffuse 
albedos. The ratio of the MODIS-based 8-day diffuse visible albedo to the SiB-based diffuse 
visible albedo at a given surface element serves as the 8-day ‘scaling factor’ for that element. 
During a full simulation, the time-step values of both the visible direct and visible diffuse 
albedos computed with the SiB-based scheme are multiplied by the diffuse-based scale factor 
(for the given element and given 8-day period) prior to being applied to the incoming radiation 
values. The same approach is used to compute the scale factors for the near-infrared albedos. 


6.2 Data files and movies 

6.2.1 MODIS Albedo Climatology [Diffuse, Visible (0.3-0.7|im) and Near-Infrared 
(0.7-5.0|um)] 

Note: CLSM does not use MODIS albedo data explicitly. 

file names : AlbMap . WS . 8-day . tile . 0 . 3_0 . 7 . dat/AlbMap . WS . 8- 
day . tile . 0 . 7_5 . 0 . dat 

file format: fortran binaries, little_endian 

The read statement should be MAPL_ReadForcing compatible. 
Each data record is preceded by a header containing 
start and end times of the period over which the data 
have been averaged. Corresponding start dates for 
climatological data records are: 1227 0101 0109 0117 
0125 0202 0210 0218 0226 0306 0314 0322 0330 0407 0415 

0423 0501 0509 0517 0525 0602 0610 0618 0626 0704 0712 

0720 0728 0805 0813 0821 0829 0906 0914 0922 0930 1008 

1016 1024 1101 1109 1117 1125 1203 1211 1219 1227 0101 

(MMDD: where MM is month and DD is day of year) 

Reading data: Loop over the next two lines until the last 
data record is read. 

read (10) Year_Begin, Month_Begin, Day_Begin, Hour_Begin, 
Minute_Begin, Secs_Begin, Year_End, Month_End, 

Day_End, Hour_End, Minute_End, Secs_End (Float 
Numbers ) 

read(10) (data (n) , n=l , NTILES) 
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6.2.2 MODIS Scale Parameters [Diffused, Visible (0.3_0.7) and Near-Infrared 

( 0 . 7 _ 5 . 0 )] 


file names : visdf . dat /nirdf . dat (or . ./visdf*dat and 
. . / nirdf Mat ) 

file format: fortran binaries, little_endian 
The read statement should be MAPL_ReadForcing compatible. 
Each data record is preceded by a header containing 
start and end times of the period over which the data 
have been averaged. Corresponding start dates for 
climatological data records are: 1227 0101 0109 0117 
0125 0202 0210 0218 0226 0306 0314 0322 0330 0407 0415 

0423 0501 0509 0517 0525 0602 0610 0618 0626 0704 0712 

0720 0728 0805 0813 0821 0829 0906 0914 0922 0930 1008 

1016 1024 1101 1109 1117 1125 1203 1211 1219 1227 0101 

(MMDD: where MM is month and DD is day of year) 

Reading data: Loop over the next two lines until the last 
data record is read. 

read (10) Year_Begin, Month_Begin, Day_Begin, Hour_Begin, 
Minute_Begin, Secs_Begin, Year_End, Month_End, 

Day_End, Hour_End, Minute_End, Secs_End (Float 
Numbers ) 

read(10) (data (n) , n=l, NTILES) 


7. CATCHMENT LAND SURFACE MODEL PARAMETERS 


7.1 Data generation and processing chain 


The Catchment LSM utilizes numerous preprocessed parameters, many of which describe ‘fits’ 
to the results of highly complex calculations. For efficiency purposes, these fits are used in place 
of the complex calculations themselves during simulations. The parameters derived for each 
surface element rely in part on the statistics of compound topographic index in "cti_stats.dat" 
(Section 4.2.3) and the soil hydraulic properties in "soil_param.dat" (Section 4.2.4). The user is 
referred to Ducharne et al. (2000) for a description of the CLSM parameter generation process 
and for definitions of the parameters themselves. 


7.2 Data files 

In the descriptions below, equation and figure numbers refer to those in Ducharne et al. (2000). 
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7.2.1 Time scale parameters for moisture transfer between surface excess and root 
zone excess prognostic variables 
file name : tau_param.dat 
do n = 1, NTILES 

read (10, ' (i8, i8, 4f 10 . 7) ' ) tile^index, pfaf_index, 
atau2, btau2, atau5, btau5 
end do 
where : 

(1) gnu vertical transmissivity parameter {v in Equation 
(8)} [m-1] 


(2) 

atau2 

3-t2 

in 

Equation 

(17) 

for 

a 

2 cm 

surface 

layer 

[-] 

(3) 

btau2 

b X 2 

in 

Equation 

(17) 

for 

a 

2 cm 

surface 

layer 

[-] 

(4) 

atau5 

a X 2 

in 

Equation 

(17) 

for 

a 

5 cm 

surface 

layer 

[-] 

(5) 

btau5 

a X 2 

in 

Equation 

(17) 

for 

a 

5 cm 

surface 

layer 

[-] 


6.2.2 Time scale parameters for moisture transfer between root zone excess and 

catchment deficit prognostic variables 

file name : ts.dat 
do n = 1, NTILES 

read (10, ' (i8,i8,f5.2,4(2x,el3.7)) ') tile_index, 
pfaf_index, gnu, tsal, tsa2, tsbl, tsb2 
end do 

where : 

(1) gnu vertical transmissivity parameter {v in Equation 

(8)} [m-1] 

(2) tsal a T i in Equation (16) for positive root zone excess 

(Ducharne et al . , 2000: Figure 6) [-] 

(3) tsa2 a T i in Equation (16) for negative root zone excess 

(Ducharne et al . , 2000: Figure 6) [-] 

(4) tsbl b x i in Equation (16) for positive root zone excess 

(Ducharne et al . , 2000: Figure 6) [-] 

(5) tsb2 b-tiin Equation (16) for negative root zone excess 

(Ducharne et al . , 2000: Figure 6) [-] 

7.2.3 Baseflow parameters 

file name : bf.dat 
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do n = 1, NTILES 

read (10, ' (i8,i8,f5.2,3(2x,el3.7)) ') tile_index, 
pfaf_index, gnu, bfl, bf2, bf3 


end 

do 





where : 





(1) 

gnu 

vertical transmissivity 

parameter 

{v in Equation 


(8) 

} [m-1] 




(2) 

bfl 

A in Equation (9) 

[kg 

m-4] 


(3) 

bf 2 

B in Equation (9) 

[m] 



(4) 

bf 3 

XBAR in Equation (8) 

(same as cti 

mean in Section 


4.2.2) [ log (m) ] 


7.2.4 Area fractioning parameters 

file name : ar.new 
do n = 1, NTILES 

read (10, ' (i8,i8,f5.2,ll(2x,el3.7)) ') tile_index, pfaf_index, 
gnu, arsl, ars2, ars3, aral,ara2, ara3, ara4, arwl, 
arw2, arw3, arw4 


end 

do 







where : 







(1) 

gnu 

vertical transmissivity parameter {v in Equation 


(8) } 

[m-1] 






(2) 

arsl 

A in Equation 

(12) 

for A sat [m+2 

kg-1] 



(3) 

ars2 

B in Equation 

(12) 

for A sat [m+2 

kg-1] 



(4) 

ars3 

C in Equation 

(12) 

for A sat [m+4 

kg-2 ] 



(5) 

aral 

A in Equation 

(14) 

of segmentl 

if skewness 

< 

0.25 


[m+2 

kg-1] | (else 

aral 

= ara3) 




(6) 

ara2 

B in Equation 

(14) 

of segmentl 

if skewness 

< 

0.25 


[-] 

I (else 

ara2 

= ara4) 




(7) 

ara3 

A in Equation 

(14) 

of segment2 

if skewness 

< 

0.25 


[m+2 

kg-1] 






(8) 

ara4 

[-] 

arwl 

B in Equation 

(14) 

of segment2 

if skewness 

< 

0.25 

(9) 

A in Equation 

(12) 

for 0 O [m+2 

kg-1] 



(10) 

t arw2 

B in Equation 

(12) 

for ©o [m+2 

kg-1] 



(11) 

i arw3 

C in Equation 

(12) 

for ©o [m+4 

kg-2] 



(12) 

i arw4 

Yoo in Equation 

(12) 

for @o [-] 
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8. GLOBAL RUNOFF ROUTING MODEL DATA 


8.1 Data generation and processing chain 


The Pfafstetter codification (Verdin and Verdin, 1999) assigns a unique multi-digit integer to a 
given hydrologic catchment within a river basin. The multi-digit integer, or Pfafstetter code, 
contains information about connectivity of the catchment with upstream and downstream 
catchments; considering all of the catchments’ codes together allows the construction of a 
catchment network within the basin. Verdin (2013) provided global raster arrays of global Level 
12 Pfafstetter codes at 1-arcmin resolution along with information on mean elevation. These 
data sets were used to build the global river channel network slated for use with GEOS-5 (see, 
e.g, Figure 10). 



Figure 10: Illustration of river channel network in South America. 
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The steps used to generate the river network are as follows. Each catchment (referred to in this 
discussion as CatchX) has typically zero or two upstream catchments (although some catchments 
may have one or more than two, as discussed below) but only one downstream catchment (or 
only one sink, as an ocean or a lake). The Pfafstetter code helps locate the downstream 
catchment. Once it is identified, CatchX automatically becomes one of the upstream catchments 
for the downstream catchment. During the Pfafstetter codification process, sometimes islands 
can be incorrectly linked across the ocean. Sinks also often get incorrectly linked with 
catchments geographically far apart. Thus, a land-water mask at 1 arc-minute resolution was 
created using the 10 arc-second mask (Section 2) created to help identify islands and coastal 
catchments. Islands, coastal catchments and sinks were given due attention to ensure that they 
don’t get linked up incorrectly. The 1 arc -minute elevation data were used to determine the 
lowest point (outlet) of each catchment. 

Once the upstream and downstream catchments for CatchX are identified, the next step is to find 
the latitudinal and longitudinal coordinates of its upstream and downstream confluences. This is 
complicated by the fact that the codification at 1-arcmin is imperfect - in a perfect system, a 
catchment would only have zero or two upstream catchments, but because of the discretization to 
1-arcmin and the associated loss of higher resolution information, a catchment might end up 
having (according to the discretized codification) one or possibly more than two upstream 
catchments. To determine the locations of the upstream confluences, we must therefore consider 
four possible cases: 

Case 1: CatchX has no upstream catchments. In this case, the centroid of CatchX is used 
as the location of the upstream confluence, for purposes of computing river length and 
elevation difference. 

Case 2: There is only one upstream catchment. The nearest 1-arcmin CatchX pixel to the 
centroid of the upstream catchment is assumed to be the upstream confluence. 

Case 3: Exactly 2 upstream catchments are present. The meeting point of CatchX and the 
two upstream catchments is determined and assigned to be the location of the upstream 
confluence. 

Case 4: There are more than 2 upstream catchments. For each 1-arcmin pixel within 
CatchX, the sum of the distances between the pixel and the centroids of each upstream 
catchment is computed. The pixel with the minimum sum-of-distances is assumed to be 
located at the upstream confluence. 

To determine the location of the downstream confluence, only two cases need to be considered: 
Case 1 : There is a downstream catchment. The location of the downstream confluence of 
CatchX is taken to be the same as that of the upstream confluence for that downstream 
catchment. 

Case 2: There is no downstream catchment. A 1-arcmin CatchX pixel located next to a 
water pixel is assumed to be the location of the downstream confluence. 


8.2 Data files 
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8.2.1 Pfafstetter watershed connectivity, channel information 


file name: /discover/nobackup/smahanam/l_data/ 

LandBCs_f iles_f or_mkCatchParam/ SRTM-TopoData/ 

Pfaf catch- routing . dat 
read (10,*) NPfafs 

do n = 1, NPfafs 

read (10, ' (i8,il5,4(lx,f9.4),lx,el0.3,3(lx,e9.3),I8,6(lx,f9. 
4))') pfaf_index, pfaf_code, min_lon, max_lon, 
min_lat, max_lat, mean_elevation, cat_area, length, 
ElevDiff, dnst_pf af^index, DN_long, DN_lat, UP_lon, 
UP_lat, mouth_lon, mouth_lat 
end do 


pfaf_index catchment index (1-291284) after sorting 

Pfafstetter codes in ascending order [-] 


pf af_code 
[ - ] 

min_lon 

catchment 

max_lon 

catchment 

min_lat 

catchment 

max_lat 

catchment 


Pfafstetter code of the hydrologic catchment 


Longitude of westernmost edge of 
[degree] 

Longitude of easternmost edge of 
[degree] 

Latitude of southernmost edge of 
[degree] 

Latitude of northernmost edge of 
[degree] 


the 


the 


the 


the 


mean elevation Area-averaged elevation [m] 


cat_area 

Length 

ElevDiff 

DN_pfaf_index 

DN_long 

DN_lat 

UP_lon 

UP_lat 
mouth_lon 
mouth lat 


catchment area [km2] 

Distance between upstream and downstream 
confluences [km] 

Elevation difference between upstream and 
downstream confluences [m] 

catchment index of the downstream catchment 
(Note : -1 depicts a sink or lake/ocean) [-] 
longitude at the downstream confluence 
[degree] 

latitude at the downstream confluence 
[degree] 

longitude at the upstream confluence 
[degree] 

latitude at the upstream confluence [degree] 
longitude at the river mouth [degree] 
latitude at the river mouth [degree] 
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8.2.2 Fractional areas to aggregate from SMAP grid cells to Pfafstetter watersheds 

Because surface catchment elements for AGCM grids are derived by overlaying AGCM 
grids on the Pfafstetter watershed map, information in the BCRSLV-Pfafstetter.til file 
(Section 4.2. 1) can be used to convert variables between computational surface elements 
and Pfafstetter watersheds. However, quasi-rectangular SMAP grid cells are completely 
independent of Pfafstetter watershed boundaries. The self-describing SMAP- 
Catch TransferData.nc file contains the infonnation needed to convert between SMAP 
grid cells and watersheds. The information includes the number of Pfafstetter watersheds 
contributing to the SMAP grid cell and the fractional areas of those contributing 
watersheds. 

file name : SMAP-Catch Transf erData . nc 


where : 


NCats_in_SMAP 

Pf af_Index 

Pf af_Area [ km 2 ] 
Pfaf Frac [-] 


No. of pfaf catchments contributing to the 
SMAP cell 

Pfaf indices (1-291,284) of those 
contributing catchments 
Area of the Pfaf Catchment fraction 
Fraction of Pfaf catchment contributing to 
the SMAP cell 
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APPENDIX A: mkCatchParam Software 


Usage: mkCatchParam -x nx -y ny -g Gridname -b DL -m MA -1 LD -s 
SD -e EASE 

-x Size of longitude dimension of input raster. DEFAULT: 8640 

-y Size of latitude dimension of input raster. DEFAULT: 4320 

-g Gridname (name of the .til or . rst file without file 
extension) 

-b Position of the dateline in the first grid box (DC or DE) . 
DEFAULT: DC 

-1 Choice of LAI data set (Default MODGEO) 

MODGEO: MODIS with GEOLAND2 overlaid on South America, 
Africa and Australia 

GEOLAND2 : 10-day climatology from the period 1999-2011 on 
40320x20160 grid 

GSWP2 : Monthly climatology from the period 1982-1998 
on 360x180 grid 

MODIS: 8-day climatology from the period 2000-2013 on 
43200x21600 grid 

GSWPH: Monthly climatology from the 2D-spatially 

interpolated GSWP-2 LAI for the period 1982-1998 on 
43200x21600 grid 

-s Choice of soil data (Default HWSD) 

HWSD : Merged HWSD1 . 2 1 -STATSG02 soil properties on 

43200x21600 grid, with De Lannoy (2014) soil hydraulic 
parameters 

NGDC : Reynolds soil texture classes on 4320x2160 grid with 
GSWP2 soil hydraulic parameters 
-m Choice of MODIS Albedo data (Default MODIS2) 

MODIS1: 16-day Climatology from l'xi' (21600x10800) MODIS 
data from the period 2000-2004 
MODIS2: 8-day Climatology from 0.5'x0.5' (43200x21600) MODIS 
data from the period 2001-2011 
-e EASE: This option is only used for SMAP grids 


APPENDIX B: Miscellaneous programs 


(Bl) Deriving parameters for user-specified quasi-rectangular 

grid cells (for e.g., LIS, GSWP-2) either globally or over 
a small domain 

USAGE : bin/mkLISTilesPara -vfile VectorFile 
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The user-specified ASCII file (VectorFile) should contain a one- 
line header followed by latitude and longitude at the center of 
each grid cell in a row separately for each grid cell. The one- 
line header reads: 

N_Cells, dx,dy, DL, NC_domain, NR_domain, i_offset, j_offset 
where : 

Four mandatory parameters : 

N_Cells number of grid cells in the study 

dx cell resolution in longitudinal direction in degrees 

dy cell resolution in latitudinal direction in degrees 

DL Location of the dateline 

0: the dateline lies along the center of the 1st 
column (for e.g., DC in GEOS-5 finite volume grids) 

1: the dateline is along the western edge of the grid 

Four optional parameters: 

NC_domain* Number of columns in the domain 
NR_domain* Number of rows in the domain 

i_offset* i index of the lower left grid cell of the domain 

minus 1 when the domain grid is overlaid on a global grid 
of the same resolution counting columns from the dateline 
to the dateline 

j_offset* j index of the lower left grid cell of the domain 

minus 1 when the domain grid is overlaid on a global grid 
of the same resolution counting rows from the south pole to 
the north pole 

Note that the last four optional parameters are only used to 
derive parameters for a smaller rectangular study domain. 
They must be left blank if parameters are being derived for 
quasi-rectangular surface elements on a global grid. Note 
also that if the domain expands to the globe, the expanded 
grid must have the dateline and the south pole as the 
western and southern edges. 

(B2) Deriving parameter files for SMAP-EASE grids 
USAGE : bin/mkSMAPTiles_v2 -smap_grid MXX 
Allowed SMAP grids are: M01 M03 M09 M36 

(3) Software to debug parameters Section 7.2.4 

This utility helps debug the curve-fitting algorithm that 

derives arsl, ars2, ars3, aral,ara2, ara3, ara4, arwl, 
arw2, arw3, arw4 in Section 7.2.4. 


47 



Usage : chk_clsm_params -s Y -m MaskFile 

-m GE0S5_1 Oarcsec_mask . nc for new mask or just -m for older 
mask 

The user will be asked to enter statistics regarding the 
compound topographic index and the soil hydraulic 
properties in the problematic catchment surface element. 
The hope is that no one will ever need to use this 
software. The user is advised to consult the GMAO land 
modeling team should a curve-fitting problem ever arise 
(rather than attempting to debug the problem himself or 
herself) . 


APPENDIX C: Acronyms 

AGCM 

Atmospheric General Circulation Model 

AVHRR 

Advanced Very High Resolution Radiometer 

BCSDIR 

Boundary conditions directory as specified in gem run.j in the GEOS-5 
experiment 

BCRSLV 

Resolution as specified in gcmrun.j 

CLM 

Common Land Model 

CLSM 

Catchment Land Surface Model 

DEM 

Digital Elevation Model 

ESA 

European Space Agency 

GEOS-5 

Goddard Earth Observing System version 5 

GMAO 

Global Modeling and Assimilation Office 

GmFrac 

Greenness Fraction 

GSWP2 

The second phase of the Global Soil Wetness Project 

HWSDvl.21 

Harmonized World Soil Data version 1.21 

HYDRO lk 

Global coverage of stream and drainage basins derived using USGS 30 
arc-second digital elevation model (DEM) 

IGBP 

International Geosphere Biosphere Project 
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LAI 

Leaf Area Index 

LBCs 

Land surface Boundary Conditions 

MERRA 

Modem Era Retrospective analyses for Research and Applications 

MODIS 

Moderate Resolution Imaging Spectroradiometer 

NDVI 

Normalized Difference Vegetation Index 

NIRDF 

Near-infrared Diffused Albedo 

NTILES 

Number of catchment tiles (surface elements) that make up the Earth’s 
land surface in the data set 

SiB2 

Simple Biosphere model version 2 

SMAP 

NASA Soil Moisture Active Passive mission 

SRTM 

Shuttle Radar Topography Mission 

STATSG02 

The Digital General Soil Map of the United States 

UCAR 

University Corporation for Atmospheric Research 

USGS 

Unites State Geological Survey 

VISDF 

Visible Diffused Albedo 
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